Skip to content

upstream: handle health check fail after removal#6765

Merged
mattklein123 merged 8 commits intomasterfrom
fix_eds_race
May 1, 2019
Merged

upstream: handle health check fail after removal#6765
mattklein123 merged 8 commits intomasterfrom
fix_eds_race

Conversation

@mattklein123
Copy link
Member

When using active health checking, hosts are not removed from
dynamic clusters if they are still passing health checks. This
creates a situation in which hosts might not be removed for a
very long time if the sequence is reversed; removal followed by
health check failure. This change handles the second case so that
any time a host is both removed AND failing active health check,
in any order, it will be removed.

This has been an issue "forever" but is more obvious when using
streaming EDS or very long polling DNS.

Fixes #6625

Signed-off-by: Matt Klein mklein@lyft.com

Risk Level: Medium/High. Scary stuff.
Testing: New unit tests.
Docs Changes: N/A
Release Notes: N/A

Loading
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

cluster membership race when streaming eds data and active health checking

2 participants